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HIGH TEMPERATURE ASYMPTOTICS OF ORTHOGONAL MEAN-FIELD 

SPIN GLASSES 

BHASWAR B. BHATTACHARYA AND SUBHABRATA SEN 


Abstract. We evaluate the high temperature limit of the free energy of spin glasses on the hy¬ 
percube with Hamiltonian Hn(o) = a T Ja, where the coupling matrix J is drawn from certain 
symmetric orthogonally invariant ensembles. Our derivation relates the annealed free energy of 
these models to a spherical integral, and expresses the limit of the free energy in terms of the limit¬ 
ing spectral measure of the coupling matrix J. As an application, we derive the limiting free energy 
of the Random Orthogonal Model (ROM) at high temperatures, which confirms non-rigorous cal¬ 
culations of Marinari et al. [20]. Our methods also apply to other well-known models of disordered 
systems, including the SK and Gaussian Hopfield models. 


1. Introduction 

Consider a (random) function on the hypercube Hn ■ «SV = {—1, +1}^ —> R defined as 

Hjg(a) = a T Ja (1.1) 

with coupling matrix J = ODO T , where O is Haar distributed over the orthogonal group O(N) and 
D = diag(di, • • • , d/v) is a diagonal matrix independent of O. This defines a probability distribution 
over SV as follows: for t e Sjg and (5 > 0, 


cr = T = 


1 

w 


o 0H n (t) 


Z n (J3,0,D)’ (L2) 

where the partition function Zjg(f3, O, D) = YlaeS N ex P (/3H]\r(a)). These distributions arise fre¬ 
quently in the analysis of disordered systems in statistical physics. In this context, Hjv(cj) describes 
the energy of the configuration a, and is usually referred to as the Hamiltonian of the system. The 
parameter /3 denotes the inverse temperature, so the high temperature regime corresponds to small 
values of j3. We seek to evaluate the large N limit of the free energy 


* N {p,0,D) = -\ogZ N tf,0,D) 


(1.3) 


in these models. 

Models of the form (1.2) will be referred to as orthogonal mean-field spin glasses — they include 
many well-known physical models of disordered systems: 

(a) Sherrington-Kirkpatrick (SK) Model: In the SK model of spin glasses the coupling matrix 
J = -^=W, where W is a symmetric matrix drawn from the Gaussian Orthogonal Ensem¬ 
ble. It is well known that W = ODO T , where O ~ 0(N ) is Haar distributed and D = 
diag(di, g? 2 , • • • , cZjv) is a diagonal matrix independent of O, such that the empirical measure 
jj S di converges to the semi-circle law [ ]. The limit of the free energy for all temperatures 
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was conjectured by Parisi using deep ideas of replica symmetry breaking, and was rigorously 
established by Talagrand [ ] (refer to [22] for an introduction to this subject). Carmona and 

Hu [5] (see also Chatterjee [7]) proved that the Parisi formula continues to hold even if the 
entries of the coupling matrix J = (( Jij )) are independent mean zero random variables, subject 
to some conditions on the higher moments. 

(b) Random Orthogonal Model (ROM): Marinari et al. [20] introduced the ROM to model a deter¬ 
ministic system which exhibits glassy behavior. In this model the coupling matrix J = ODO T , 
where D = diag(di, • • • , djy) is a deterministic sequence of {±1} such that the empirical mea¬ 
sure 

1 N 

Hn(D) =—'^2S di ^pS 1 + (l-p)S- 1 , (1.4) 

2=1 

for some p E (0,1). The case p = 1/2 has received a lot of attention in the physics literature 
(see [3, , ] and the references therein). The limiting free energy of this model is not known 

rigorously even in the high temperature regime. The coupling matrix J has dependent entries 
and non-rigorous calculations based on the replica method predict different behavior compared 
to the SK model [8, 19, 20]. This suggests that comparison/universality techniques like [5, 7 
cannot be directly used to compute the free energy. 

(c) Gaussian Hop field Model: Cherrier et al. [8] considered the Gaussian Hopheld Model where 
the coupling matrix J = ^XX T , where X = (( X t] )) is a N x p matrix with i.i.d. A7(0,1). The 
coupling matrix of the usual Hopheld model has the same structure, but the matrix X consists 
of i.i.d. Rademacher {±1} random variables. Bovier et al. [ I] studied the Gaussian Hopheld 
model with 2-patterns and this “simple” case already shows highly complicated behavior. It is 
generally believed that a Hopheld model with p parameters where p ~ A N is signihcantly more 
complicated compared to the one with a finite number of patterns. 

This paper gives a general method for computing the limit of the free energy in orthogonal 
mean-held spin glass models at sufficiently high temperatures (see Theorem 1.2). Exploiting a 
connection with spherical integrals [6, 16] and using techniques from large deviations and random 
matrix theory, we rigorously justify certain heuristics employed in the traditional analyses of these 
systems. In particular, we derive: 

1. the limiting free energy of the SK model in the entire high temperature phase (Corollary 
2.1), re-deriving the classical result of Aizenman et. al. [I], 

2. the limiting free energy of ROM for (5 sufficiently small (Corollary 1.3), which verifies pre¬ 
dictions of Marinari et al. [20], and 

3. the limiting free energy of the Gaussian Hopheld model with p/N -A A E (l,oo) for high 
temperatures, confirming non-rigorous calculations of Cherrier et al. [8]. We remark that 
our techniques should also apply to the case A E (0,1) but we restrict ourselves to the hrst 
case for clarity. 

1.1. Main Results. To state our main results we need to introduce some notations. The Haar 
measure on the orthogonal group 0(N ) will be denoted by dO, and the expectation of a function 
/ will be denoted by Eo/(0) := f Q /(O)dO. 

For any probability measure p, denote by supp the support of p. We will always consider 
probability measures with bounded support so that supp (p) C [A m i n , A max ]. To describe our results 
we need to introduce the Hilbert transform and the i?-transform of a probability measure with 
bounded support: 
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Definition 1.1. The Hilbert transform H tl of a measure /r is : M\supp(/i) -A M 

f (1.5) 

It is easy to show that H^ is a bijective map from R\supp(/i) to H m ax )\{0} (see [16]), where 

Umax = lim H min = lim H^z). (1.6) 

^J-Amax -^min 

Thus, Setting X m [ n = A m i n l/-f^min? ^max = A max l/-f^max!i £uid xn = J Ad/i^, for z G 
# M (M\suppO)), define the i?-transform : (# min , # max )\{0} -> (x min , x max )\{m} as 

H tl (R lt (z) + ±)=z. (1.7) 

It is easy to see that R p is bijective and we denote its inverse by Q Let 

I ^ = \f o R n( v ) dv - ( L8 ) 

Finally, for any j3 > 0, define 

1/. — 2/1(1 tanh/l(x max ^Cmin))) 1 /1 — 2/3(1 T tanh /l(x max ^min))- (1-9) 

We will restrict ourselves to models where the sequence of random empirical measures //*(D) = 
~k XA;=i corresponding to the matrix D in (1.2) satisfy certain “rigidity” properties. This allows 
us to neglect the fluctuations of the spectrum in the calculation of the free energy limit. We impose 
the following property on the law of the matrix D. 

Hypothesis 1. Let D = diag(di,d 2 , ■ • • , djv) be a (random) diagonal matrix with empirical mea¬ 
sure /qv(Zl) = JT =1 5d r Assume that 

(a) there exists a sequence of numbers ili/v = o(y/~N) such that 

lim PfllDlU > M n ) = 0, 

N-t oo 

where H-DHoo = max.i<i< N \di\; 

(b) there exists a deterministic measure i>n supported on N points in R such that for any c > 0, 

lim P (w 2 {hn{D),v n ) > TA 0 (1.10) 

N —>oo \ vIV J 

where W 2 (-, •) is the 2-Wasserstein distance between two probability measures. 

In most of our applications, it suffices to take y/v = * YliLi $E(di)- It can be easily checked that 
all our results continue to hold with any sequence of probability measures y/v satisfying Hypothesis 
1. However, we state our results with v * = Y2iLi &E(di) f° r clarity. We define, for any deterministic 
diagonal matrix A = diag(Ai, • • • , A *), 

Fjv(/3,A) = -^E 0 (log 2*09,0, A)). (Lll) 

Note that due to the invariance of the Haar measure on O(N), T*(/3, A) is only a function of the 
empirical distribution //*(A) = ]>T = , The following proposition establishes that we may 

neglect the fluctuations of the spectrum for the calculation of the free energy. 
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Proposition 1.1. Consider an orthogonal mean field spin glass model (1.2) with a (random) di¬ 
agonal matrix D = diag{d \, c^,..., d^). If the sequence of measures px{D) = YliL] E, satisfies 
Hypothesis 1, then 

\$ n (P,O,D)-T n (P,E(D))\$0. (1.12) 

The proof of Proposition 1.1 is outlined in Section 3.1. Given this result, to compute the limit 
of the free energy liiryv^oc O, D) it suffices to compute the limit of r>(/3,E (£>)). 

A crucial ingredient in the analysis of the asymptotics of Tjv(/3, A) is a connection with a spherical 
integral. Guionnet and Maida [ ] derived the asymptotics of these integrals in terms of the R- 

transform of the limit p of the empirical measure pn{A) = ^ 1 S\ i (refer to Section 1.2 for 
details). They assume the following conditions on the measure pw(A): 

Hypothesis 2. For a deterministic diagonal matrix A = diag(Ai, A 2 ,..., Ajv), denote by pn(A) = 
W E)Ii i the empirical measure of A. Assume that 

(a) the sequence of measures {pn(A)}n>i converges weakly to a compactly supported measure 
p, and 

(b) A min (A) := mini<j<jvAi and A max (A) := maxi<j<jv Aj converge to A min and A max which are 
finite. 

We will also assume Hypothesis 2 to determine the limit of the partition function Zn(/3, O, D). 
We have the following general result for the limiting free energy at high temperature. 

Theorem 1.2. Consider an orthogonal mean field spin glass model (1.2) with a (random) diagonal 
matrix D = diag{d\, di ,..., djy). Assume that 

(a) the sequence of (random) measures pn(D) = ^ EaIi E, satisfies Hypothesis 1, and 

(b) the sequence of deterministic measures /tat(E (D)) = ^ J2iLi E(d.,) h an d satisfies Hypothesis 

2 . 

(c) limsup^o (3 2 sup^gj^ ^j |i?'(/3o)| < \, where Ul and Ur are as defined in (1.9). 

Then for fi sufficiently small (depending on p), 

$jv(/3,O,£)4 409), (1.13) 

with defined in (1.8). 

As a consequence of the above theorem, we obtain the limiting free energy for many well-known 
models of disordered systems. Most importantly, we derive the limiting free energy of ROM (1.4) for 
fi sufficiently small (Corollary 1.3), which matches the predictions of Marinari et al. [20] obtained 
by non-rigorous methods. The limiting free energy for the case p = 1/2 is given in the following 
corollary. Refer to Proposition 2.2 for the expression for any p 6 (0,1). 

Corollary 1.3. For the random orthogonal model (ROM) with p = 1/2, for ft sufficiently small, 

llogZ 7V (/3,0, J D)4^v / 16/3 2 + l + log - 1 ) • ( L14 ) 

Using Theorem 1.2 we can also obtain the limiting free energy of the SK model in the entire 
high temperature phase (Corollary 2.1), re-deriving the classical result of Aizenman et al. [ ]. Our 
calculations also give the limiting free energy for the Gaussian Hopfield model at high temperatures, 
verifying non-rigorous calculations of Cherrier et al. [^]. 
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1.2. Proof Outline and Connections to Spherical Integrals. Spherical integrals over the 
orthogonal group 0(N) (also known as Harish Chandra-Itzykson-Zuber (HCIZ) integrals [6]) are 
integrals of the form 



exp (IV tv(OD nO t En))<10 , 


(1.15) 


where Dn and En are N x N diagonal matrices. HCIZ integrals have been studied due to their 
connection to matrix models and the enumeration of planar maps (refer [17] and the references 
therein). Asymptotics of spherical integrals was studied by Guionnet and Maida [16] in the regime 
where the rank of Dn is small compared to N. An alternative simpler proof was provided in [9]. 

To see the connection of such integrals to mean-field orthogonal spin glass models consider the an¬ 
nealed free energy of the model (1.2): 4>n((3, A) = logEo-Zjv(/3, O, A), where A = diag(Ai, A 2 ,..., A tv) 
is a deterministic diagonal matrix. Note that 

Z n (P, 0,A) = exp(/3a r OAO T a) = exp ^V/3tr joAO r • ( L16 ) 

at aeSN 


By the spectral decomposition = P a E\ \ P a where E\\ = diag(l, 0,..., 0). Using (1.16) and the 
invariance of the Haar distribution, 

E 0 (Z N ((5,O, A)) = E 0 exp (Nfitr {OAO T E u }) ■ (1.17) 

This is exactly of the form (1.15) with Djsr = A and En = E\\ = diag(l, 0,..., 0). Therefore, 
the annealed free energy (/>n(P, A) for any deterministic diagonal matrix A, is given by a spherical 
integral. The limit of 4>n(/3, A) was derived by Guionnet and Maida [16], when Hypothesis 2 holds: 


Theorem 1.4 (Guionnet and Maida [16]). Consider an orthogonal mean field spin glass model 
(1.2) with a deterministic diagonal matrix A = diag( Ai, A 2 ,..., Ajv)- If the sequence of empirical 

measures pn{A) = Ay —> p an d Hypothesis 2 holds, then for fi sufficiently small (depending 

on p) 

lim (j) N (/3, A) = I m (/3). (1-18) 

IV—»oo 


The proof of Theorem 1.2 proceeds as follows: when D is random in (1.2), then under Hypothesis 
1 we can replace the random matrix D by the deterministic matrix E (D). Theorem 1.2 then 
involves computing the limit of the annealed free energy 0 tv(/ 3,E(H)) using the above theorem, 
and the corresponding second moment. This together with results about concentration of measure 
gives the desired result. 


Remark 1.1. When D is random, another natural approach is to compute the total annealed free- 
energy = jj logE(Zjy(/3, O, D)), where the expectation is respect to the joint distribution 

of ( 0,D ). From (1.16) it is easy to see that 


<#■“"(/?) = i logE ( AT/S 


Eii x i . 


(1.18) 


where the AQ are i.i.d. A/"(0,1) random variables. 

It is expected that for /3 sufficiently small, this also gives the correct limit for the free energy. To 

this end, consider the random measure vn = YhiLi $d,, he., z'jv is a random discrete measure 




X'~ 

which assigns random weights to the random positions dk . Gamboa and Rouault [ ] derived 


E 
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a large deviation principle for the random measure z/jy, under certain technical assumptions on 
the sequence hn(D) = We believe that under these assumptions a second moment 

argument can be done to derive the high temperature limit of the free energy <&n(/3, O, D). However, 
this requires the full large deviation principle for the sequence {^at(-D)}at>i. On the other hand, we 
only need control on the tails of hn(D) in terms of the 2-Wasserstein distance, which is generally 
much easier to verify. 

1.3. Organization. The rest of the paper is organized as follows: The proof of Corollary 1.3 and 
the application of Theorem 1.2 to various other examples are given in Section 2. The proofs of 
Proposition 1.1 and Theorem 1.2 are given in Section 3.1 and Section 3.2, respectively. 

2. Examples 

In this section, we apply Theorem 1.2 to evaluate the limit of the free energy in various orthogonal 
mean-field spin glass models. 

2.1. The SK Model. Recall the definition of the SK-model introduced in Section 1. In this case, 
the coupling matrix J = W/y/N, where IT is a GOE matrix of order N. Thus J = ODO T , where 
O is Haar distributed and independent of D. It is a classical result in random matrix theory that 
Hn(D) = tJt my(D) converges almost surely to the Wigner semicircle law [ ] 

1 - 2 , 2 ]}. ( 2 . 1 ) 

Further, the edge of the empirical distribution converges to the edge of the semicircle law. 

An application of Theorem 1.2 yields the following corollary about the high temperature limit 
of the free-energy. It is well known that the SK model has a phase transition at (3 = 1/2. Our 
approach covers the whole high temperature region of the SK model, thus re-deriving the classical 
result of Aizenman et. al. [I]. 

Corollary 2.1. For the SK model with f3 < 1/2, limAn-oo yr log Z^(/3, O, D) f3 2 . 

Proof. In this case A m i n = —2, A max = 2. Using the density of the semi-circle law (2.1), the Hilbert 
transform can be easily computed to be H p (z ) = — \Jz 1 — 4) for z £ R\[—2,2]. This implies 

Umax = 1) H m [ n = —1, and x max = 1, x m \n = —1. Thus, using Definition 1.7, R p (z) = z on 
(— 1, 1)\{0}, Ip(z) = z 2 , and condition (c) in Theorem 1.2 holds trivially. This gives the desired 
conclusion subject to the verification of the other conditions of Theorem 1.2. 

It is well known that the measure //jv(E (D)) := ^E(di) satisfies Hypothesis 2 [ ]. Further, 

by [10, Corollary 4] there exists C > 0 such that 

E{W 2 (MD),MHD)))} < ( 2 . 2 ) 

Hypothesis 1 then follows using Markov’s inequality. 

To see that the second moment method employed in our proof works up to (3 < 1/2, see Remark 

3.1. □ 

2.2. The Random Orthogonal Model. In the random orthogonal model (ROM) introduced in 
Section 1 the coupling matrix J = ODO T , where D = diag(di, • • • , djy) is a deterministic sequence 
of {±1} such that the empirical measure hn(D) converges weakly to fi p := p5\ + (1 — p)S- 1 - 
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Proposition 2.2. For the random orthogonal model (ROM), there exists a p m > 0 such that for 

P < Pm, 


where m = 2p — 1. 


^ log Z N (ft 0,D) 4 


1 f 2j3 J\ + 4z(m + z) — 1 

— / ---d z. 

2 Jo 2z 


(2.3) 


Proof. In this case, the diagonal matrix D is deterministic. Thus, Hypothesis 1 holds trivially. 
Also, since the limiting measure p p is supported on two points, Hypotheses 2 is satisfied. 

In this case, A max = 1 and A m i n = —1. Moreover, by direct calculations Ff, Jp (z) = ffffj on 
(—oo, —1) U (l,oo). Thus, H m ax = oo and H m ; n = —oo which implies that 

R u P ( z ) = TT (V 1 + 4z(m + z) - l) 

is bijective from 1R\{0} to (—1, l)\{m}. 

To verify condition (c) in Theorem 1.2, recall the definition of Ul and Ur from (1.9). For 
Po E [Ul, Ur], there exists s(Po) such that |e(/3o)| < | tanh/3(x max — x m i n )| and po = 2/3(1 + e(/?o))- 
Thus, 


K P (Po) 


16/3*" (1 + e(Po))' 2 — 1 + -y/l + 4/3o {jn + Po ) 
16/3 2 (1 + e(/3o)) 2 \/l + 4/3 0 (m + /3 0 ) 


(2.4) 


From the above expression, it is easy to check that P 2 svl P/3 0 ^[u l ,u r ] r '^ p (Po) -a 0, as P -A 0. This 
verifies condition (c) and the result follows. □ 


The integral in (2.3) has a closed form expression, which can be easily computed. We refrain 
from writing this explicitly for notational clarity. However, for p = 1/2, in which case m = 0, (2.3) 
simplifies to the expression in Corollary 1.3. 

Remark 2.1. Marinari et al. [20] predicted that replica symmetry is broken in ROM with p = 1/2 
for p > 3.84. The exact location of symmetry breaking is, however, unclear. Corollary 1.3 shows 
that there exists a Po up to which the limit of free energy is given by the annealed limit. The value 
of Po can be calculated as follows: Let F(x, y) = P(x + y) + log cosh P(x + y), and 

(x*(P),y*(p)) := arg sup (F(x, y) - T^x) - T) t (y)), (2.5) 

where T^(z) := —|log(l — z 2 ), for z E [—1,1]. It is follows from the proof of Theorem 1.2 (see 
(3.29)) that po is largest P >0 such that the x*(P) = y*(P). Numerically solving the optimization 
problem (2.5) approximately gives Po < 2.7, proving that replica symmetry is preserved for P < 2.7. 

2.3. Gaussian Hopfield Model. In the Gaussian Hopfield model the coupling matrix J = 
^XX T , where X = ((X^-)) is a N x p matrix with i.i.d. Af(0, 1). For simplicity, we assume 
0 < ci < N/p < C 2 < 1. In this case, spectral distribution of J converges weakly almost surely to 
the Marchenko-Pastur law with density 

f(x)= V^A-Cs-l-A) 2 xg((1 _^) 2 ) ( 1 + v ^) 2 )) (2.6) 

2nx 

where p/N -A A. Thus, A m i n = (1 — \/A) 2 and A max = (1 + \/A) 2 in this example. 

Using the above density and Theorem 1.2 the limit of the free energy can be derived for high 
temperatures. 











BHASWAR B. BHATTACHARYA AND SUBHABRATA SEN 


Proposition 2.3. In the Gaussian Hopfield model, for f3 sufficiently small, 

1 


Jy log Zn(/3) 4 IfW) = ^log ^ 


(2.7) 


1 - 2/3 

Proof. The Hilbert Transform of the Marchenko-Pastur law ([18, Example 3.3.5]) is known to be 

x + 1 — A — y/(x — 1 — A) 2 — 4A 


H f {x) = 


2x 


( 2 . 8 ) 


Thus, in this example, H max = l/(l+\/A) and H mm = 1/(1 — n/A), which implies that .x max = A+-/A 
and x m i n = A — y/\. The i?-transform (1.7) is Rf(z) = A/(l — z). Hence, If (1.8) can be computed 
easily, which gives the formula in (2.7). Finally, to check condition (c) in Theorem 1 note that 

i?(:) = (1 -z) 2 ' 

The above representation implies that £(/3) —> 0 as /3 -A 0, thus verifying the required condition. 

The result now follows if the spectrum of the coupling matrix J satisfies Hypothesis 1. To this 
end, note that simple modifications of the arguments in [ 1, Corollary 2] yield the following: there 
exists a constant c > 0 such that 

(log /V) cloglogAr 


E(d 2 (/iN(D),E(nN(D)))) < 


N 


Hypothesis 1 follows by an application of Markov’s inequality. 

3. Proofs 


(2.9) 

□ 


3.1. Proof of Proposition 1.1. In this section the proof of Proposition 1.1 is presented. Fix 
6 > 0 and recall that &n(/3,0,D) = log Zn(/3,0, D). Therefore, by triangle inequality, 

F(|$ JV (^)-r iV (/3,E(T»))| >5) <Ti + T 2 , (3.1) 

where 


Ti = 


^ log Z N (P, O, D ) - ^E 0 log Z n ((3, O, D ) 


6 

>2 


and 


T 2 = 


^E 0 (logZ n (P,Q,D)) - ^Eolog Z N (P,0,E(D)) 


6 

>2 


(3.2) 


(3.3) 


We first control T 2 . By the rotational invariance of 0(N ), -^-Eo log Zn(/3, O, D) is actually a 
function of only the empirical distribution := where D = diag(di,-- - , <3/v). 

Thus, without loss of generality assume d\ > d 2 > ■ ■ ■ > d^■ Let O = [oi : 02 : • • • : ojv] be the 
columns of the matrix O. By the Cauchy-Schwarz inequality, 

N 


\q T ODO T a - o T OE(D)0 T o\ = 


^2(di -E (di))(a T Oi 


i —1 


< 


N^idi-mw 


(3.4) 


since (of 1 Oi) 2 = N, for all i E [N]. This implies that 


=j3VNW 2 (p N (D), m (E(D))), 


(3.5) 


















HIGH TEMPERATURE ASYMPTOTICS OF ORTHOGONAL MEAN-FIELD SPIN GLASSES 


9 


where the last step uses W 2 (pn{T)), p^(E(D))) = ^ YliLi(di ~ E(dj)) 2 • Therefore, 

To < F (w 2 (p N (D),E(p N (D))) > -A 0 (3.6) 

by Hypothesis 1, as N —> oo. 

It remains to control the first term Ti. For O £ O(N) and any fixed diagonal matrix A = 
diag(Ai, • • • ,Xn) define, 

p A(O) = ^iog exp(f3a T OAO T a). (3.7) 

CTGSjV 

Let 11A11 oo = maxi<j<7v |Aj|. Moreover, for any N x N symmertic matrix A, denote the spectral 
norm by ||A ||2 = sup^g^ and the Frobenius norm by ||A||p = (tr(A 2 )) 2 . It is easy to see 

that for Oi, O 2 £ 0(N ) and a unit vector x (that is ||x ||2 = 1), 

Ix^OiAOfx — x T 0 2 AOj x\ < |x t OiA(Oi — Oo) T x\ + \x T (Oi — 02)A0^x\ 

<2\\A\\ 00 \\O l -Oo\\ 2 

<2||A|| 00 ||0 1 -0 2 ||f’. (3.8) 


Thus, using (3.8), 


im)-m)i 


1 

N 


Z N (P,Oi, A) 
z N (p,o 2 ,A) 


< 2HAHOO/3IIO! - 0 2 \\ F . 


This implies F is Lipschitz with respect to the Frobenius norm. 

Sub-gaussian tail inequalities are known for Lipschitz functions on SO(N) (see Gromov and 
Milman [ ]). This can be used to complete the proof as follows: Now, let T be the operator 

which takes O £ SO(N ) and changes the sign of the first column of O. Clearly, for O £ SO(N), 
F(0) = F(TO). Let Pi and Ei be Haar measure and the expectation with respect it on SO(N), 
respectively. Thus, Eo(-Fd(0)) = Ei(Fd(0)), and recalling (3.2) and (3.7) it follows that 


Ti <EPi ( | F d (0) - E!(Fb(0))| > -, \\D IU < M n ) + P(||L»|U > M N ) 


<exp ( — 


CN5 2 


+ P(||T>||oo > M n ), 


P 2 m 2 , 

where C > 0 is a universal constant. By Hypothesis 1, the RHS above goes to zero as N 
Combining (3.6) and (3.9) with (3.1) the result follows. 


(3.9) 


00. 


3.2. Proof of Theorem 1.2. By concentration arguments identical to those used in controlling 
the term T\ in Proposition 1.1, the following lemma can be proved. 


Lemma 3.1. For any j3 > 0, there exists an universal constant c, independent of N. such that 

F(\<S> N (p,O,E(D))-E 0 $ N (p,O,E(D))\ > 6) < exp (—cN5 2 /(3 2 ). (3.10) 

The proof of Theorem 1.2 also requires computing the first and second annealed moments of 

z N (p,o,E(D))- 

Proposition 3.1. Under the assumptions of Theorem 1.2, for (3 sufficiently small (possibly de¬ 
pending on the limiting measure p), 

Jim ^AogE 0 (Z N (P,O,E(D)))= lim logE 0 (Z N (P, O, E(D)) 2 ). (3.11) 

TV—>oo TV 7V-»oo ZIV 
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The above lemma is the most challenging part of our argument and the proof is deferred to 
Section 3.2.1. 

The proof of Theorem 1.2 can be completed easily by combining Lemma 3.1 and Proposition 3.1 
with Theorem 1.4. To this end, set 70 = o E(D ))) 2 • R eca H the definition of the annealed free 

energy 

MAA) = ^logE 0 Z JV (/3 ) O,A). 

Then by [21, Lemma 4.1.1] 

f(\<S> n (P,0,E(D))-MPMD))\ < ^log7o) > (3.12) 

Also, note that Tat(/?, E(H)) = Eo<I>Ar(/3, O, E(D)). Thus, inequality (3.12) combined with Lemma 
3.1 gives 

Jim rjv(^,E(D)) = lim -J- logE 0 (Z N (P, 0,E(D))) = I M (/3), (3.13) 

Af->oo N-too iV 

where the last step uses Theorem 1.4. Finally, using Proposition 3.1, Theorem 1.2 follows. 

3.2.1. Proof of Proposition 3.1. For any function / : Sjy x Sj\r 1-7 M, denote by E] / (<t, r) = 
2 w So-reSjv /fen)) the expectation over the uniform measure over Sjy x Sjsr- Let A = E (D) = 
diag(Ai, A 2 , ■ • • , Ajv)- Therefore, 


E 0 (^at(/3, O, E(D)) 2 =Eq(Zn(/3, O, A) 2 ) 

= EiEoexp ( n /3 tr \oAO T + =0 j) 

= EiEoexp (Np I (l + ^ (OAO T ) n + (l - ^ (OAO T ) 22 j) , 

where we use the observation that the non-zero eigenvalues of (aa T + tt t )/N are (1 + ct i 't/N) and 
(1 — (J T t/N) respectively. Let V\ = ( OAO T )n and V 2 = (OAO T ) 22 . By interchanging the order of 
the expectation and observing that Eie A - - = (cosh A) iV , for any A 6 M, it follows that 

E 0 (Z N (p,0,A) 2 ) = E 0 (exp (NF(Vi, V 2 ))) (3.14) 


where F(x,y) = P(x + y) + log cosh P(x — y). 

The non-negativity of the log cosh function trivially implies that F(x,y) > P{x + y). Then by 
[16, Theorem 1.7], for p sufficiently small, we have 


Urn inf \ogE 0 (Z N (P, O, A) 2 ) > lim -j- logE 0 exp(NP(V! + V 2 )) 

N —^00 2A N->oo 2A 

= J im Fat logE 0 exp(M/3(Vi + V 2 )) = f t {P), 

N^-oo ZiV 


(3.15) 


where p is the limit of the empirical measure /tv(E (D)) := jj YliL} $E(di)- 

For the upper bound, let X = (X\,X 2 ,... ,XnY and Y = (Yi,Y 2 , ..., Yn ) 1 be i.i.d. jV(0,1). If 


Z= Y- 


(X,Y) 

(X,X) 


x= (z 1 ,z 2 ,...,z N y, 
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then 


(Vi,U 2 ) 


V Ehxf ’ Yd=i z i )' 


(3.16) 


Let V 2 ' = 




EiLi y? 


Note that Vi and Vf are independent, but Vi and U 2 are not. The following 


lemma shows that we can replace Vi by V.[ to get an upper bound: 


Lemma 3.2. Under the assumptions of Theorem 1.2, for any /3 > 0, 

iimsup —logE°(Ziv(/3, O, A ) 2 ) < lim logE 0 exp(A7F(Vi, U 2 )), (3.17) 

N^foo 2-N N^-oo 21\ 

where F(x,y) = f3(x + y) + log cosh/3(x — y). 


Proof. The lemma will be established using a “localization” argument similar to the one used in 
[16]. Fix k < 1/2 and 


Bn(k) = \ AT 




< N~ 


| N 1 N 

77 EU -1 <jv-«, w Y. X ‘ Y ‘ 


i— 1 


2=1 


< N~ 


(3.18) 


We adopt the following system of coordinates in IR 2iV : r, a^, ■ ■ • , are the polar coordinates 

( 2 ) ( 2 ) 

of X, n = II ill, pi is the angle between X and Y , and a[\ - ■ ■ ,ccjf_ 2 are the angles needed to 
spot Y on a cone of angle /3 2 around X. It is easy to see that (Vi, Vi) is a function of the cds while 
the event Bn(k) is determined by r and the /Ts. So (Vi, Vi) and -Bjv(n) are independent. 

Let I N = E 0 exp(NF(Vi,Vi)). By (3.14), ^7 logE 0 (Z N (/3,0, A) 2 ) = ^log/Ar. Therefore, to 
prove (3.17) it suffices to show that 

In < e(N, k)E 0 (1 Bn{k) exp (NF(V U V'))) (3.19) 

where e(N,k) < C(k) exp(A^ 1_2re ) for some constant C(k) and N sufficiently large. 

By bounding the moment generating functions of X\ and X\Y\ suitably in a neighborhood of 
zero, we get 


F(B n (k) 


N 


< 


N 


EF- 1 


2=1 


> n~ k +: 


< C'(k) exp (—c!V 1_2k ), 


1 N 

1 V y 2 
N 1 


- 1 


2=1 


> N~ 


+ - 


1 N 

nE x ‘ y < 


2=1 



(3.20) 


for some constants C'(n), c > 0 and N sufficiently large. Now, using the independence of (Vi, V 2 ) 
and Bn(k), 

In < ¥ ^M1 Bn ^MNF(V 1 ,Vi))) < £ (!V,«)Eo(l i3jvW exp(lVF(Vi,'V 2 ))). (3.21) 

By the Lipschitz property of the log cosh function \F(x,y) — F(x,z) \ < 2/3\y — z |. Therefore, 

In < e(N, k)E 0 (1 Bn{k] exp(IVF(Vi, V 2 ') + 2N/3\V 2 - V 2 '|)). (3.22) 

The upper bound in (3.19) follows if, on the set Bn(k), |U 2 — V 2 | < N~ K . To this end, note that 
on Bn(k), -^11 Y — Z\\ 2 < N~ K . Further, on Bn(k), 

^\Z t AZ- Y t AY\<^\\A\\ 00 \\Z\\\\Z- Y|| < N~ K , 


(3.23) 
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since HAHoo is finite by Hypothesis 2(b). 

From this it is easy to see that on the set Bjy(n), V 2 — Vj | < N~ K , and the proof is complete. □ 
Lemma 3.3. Under the assumptions of Theorem 1.2, for (3 > 0 sufficiently small, 

lim ^-\ogE 0 exp(NF(V u V')) = W)- (3-24) 

iV—KX) Z1M 

The proof of the above lemma is given below in Section 3.2.1. Note that the Lemma 3.3 together 
with (3.15) and (3.17) gives 

lim ^-logE 0 (Z N (f3,OMD)) 2 ) = lim -j- \ogE 0 (Z N (/3, O, E(D))) = 4(/3), (3.25) 

N^-oo 2N N^-oo Jy 

where the last equality uses Theorem 1.4. This completes the proof of Proposition 3.1. 


Proof of Lemma 3.3: The proof of this lemma follows from a large deviation result established 
in [16]. Recall the Hilbert transform and the R-transform of a probability measure v defined in 
(1.5), and (1.7), respectively. Denote the inverse of H v by K u , and that of R u by Q v . Refer to [16] 
for further details about the Hilbert and the R-transforms. 

Also, recall (1.6) 


— Ain ax 


1 


1 


and X m ; n — A m ; n „ , 

Umax rl min 


(3.26) 


where H max = lim,| Amax H v (z) and H mm = lim, tAmin H u (z). Finally, for k € (A min , A max ) c , define 

(3.27) 


h x (n) = [ log-—-dz/(A), 


K — X 


and hf m = lirn KtAm . n hjn) and hf ax = lirn KiAmax h x (n). 

The following proposition, proved in [ ], gives the large deviations rate function for the random 

variable V\. 


Proposition 3.4. ([16, Proposition 5.1]) If the sequence of non-random empirical measures //jv(A) = 
~N X)i=i — > l-t and satisfies Hypothesis 2, then the law of the random variables \ \ defined in (3.16) 
satisfies a large deviation principle with scale N and good rate function 


= < 


2 (*£))) if X £ [^minj ^max. 

if X GjXmaxj ^max| 
if X G] A m i n , X m j n [, 
otherwise. 


Umax 
2 tL x 


1 min 

2 L x 

OO 


(3.28) 


Since the empirical measures /r/v(A) satisfies Hypothesis 2, by Varadhan’s lemma [ ] we get 

Jim AT 7 lo g E oexp(AhF(Vi, Vf)) = \ sup (F(x,y) - T^(x) -T^y)), (3.29) 

N-too Zl\ Z x ,ye R 

where Tjt(-) is the good rate function of V\ and F(x, y) = /3(x + y) + log cosh /3(x — y ). 

Set tp(x,y) = F(x,y) — T fl (x) — T^(y). To prove Lemma 3.3 it suffices to establish that for /3 
sufficiently small, 

sup ip(x, y) = sup ip(x,y) = 2i /i (/3), (3.30) 

(x,y)e[x min^max] 2 


where x max and x m i n are as defined in (3.26). 
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To this end note that, 

dil) 

— = P + f3tanh./3(x - y) (3.31) 
dib 

-Q- = P~Ptanh P(x-y)-T fl (y). (3.32) 

We begin by showing that the maxima of p on the set [A m i n , A max ] 2 \[x m i n , x max ] 2 is attained on 
the boundary of the set [x m i n , x max ] 2 , for P < 4: For instance, for (x,y) G (x max , A max ] X 

[x-min,3-max] (as in Figure 1), by (3.28) T'^x) = = 2 (a 1 JL- 3 ;) • Tlien for P < #max/4, using 

tanh/3(x — y) < 1 and x > x' max , 

dp 1 

t r~(x,y) = P(1 + tanh/3(x — y)) - — -—— < 0. (3.33) 

OX ^\^max %) 

Therefore, p(x,y) < p(x ma x ,y) for (x,y) G (x max ,A max ] x [x min , x max ]. Other points in the region 
[Amin, A max ] 2 \[x min ,x max ] 2 can be dealt with similarly. 


(0,-W) 









(*. y ) 

(*«to,0) 


(0,0) 

(*—.0) 


«U»0 




(0,AJ 


(U,0) 


FIGURE 1. The domain of the function p. 


Next, we establish that for P small enough, the maxima of p on [x m j n , x max ] 2 cannot be attained 
on the boundary of [x min , x max ] 2 : For y G (x min , x max ), 
d 


Q P(x max -,y) = p(l + tanh/3(x max - y)) - \ H n 
ox 2 


< 0. 


if P < 4Ff max as in (3.33). This ensures that the maxima cannot be attained for x = x max . The 
same analysis implies that the maxima is not attained for y = x max . This establishes the required 
assertion. 

It remains to analyze the function p on (x m i n , x max ) 2 . Note that for x G (x m i n ,x max ), T' (x) = 


5<2 m (x). Therefore, from (3.31)-(3.32) any stationary point of p in (x : 
the system of equations 


mm? ^max 


) 2 is a solution of 


x = (2/3(1 + tanh/3(x — y))), (3.34) 

y = R fi (2P(l-tanhp(x-y))), (3.35) 

since R p is the inverse of Q p . 

Let a(P) be the solution of P = T p (-) = It is easy to verify that x*(/3) = y*(P) = a(p) is 

a critical point of p in (x m i n , x max ) 2 . It remains to show that for P sufficiently small the maximum 
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in (3.29) is attained at x*(f$) = y*(/3) = a(/3). This implies Lemma 3.3, since by [16, Lemma 5.7], 

To show that the maximum in (3.29) is attained at x*(/3) = y*(/3 ) = a(/3) we will show (in 
Lemma 3.5 below) that for /3 sufficiently small, the system of equations (3.34)-(3.35) has a unique 
solution (a(/3), a(/3)) which is a local maxima. This establishes that (a(/3), a(/3)) maximizes ip in 
(i miTi Jm^) 2 and (3.30) follows. 


Lemma 3.5. For (5 sufficiently small and R p satisfying condition (c) in Theorem 1.2, the system 
of equations (3.34)-(3.35) has a unique solution which is a local maximum. 


Proof. For (x,y) E [x min , x max ] 2 , define 


r t„ = ( ^( 2 /?(l +tanh/?(®- y))) 

,V \ Refill-tmhp(x-y))) 

For /3 < iLmax/4, G maps [x min ,x max ] 2 to [x min , x max ] 2 . Consider the set M = [x min ,x max ] 2 
equipped with the L\ metric, that is, di((xi, 2/i) ; (^ 2 , 2 / 2 )) = \x\ — X 2 I + \yi ~ 2 / 2 1- We note that 
(M, d\) is a complete metric space. Now, by the mean value theorem and using sech 2 < 1, 

\R/i (2/3(1 + tanh/3(aq - 2 / 1 ))) - R P (2/3(1 + tanh/3(x 2 - 2 / 2 )))! < 2f3 2 ((/3)di((xi, 2 / 1 ), (x 2 , 2 / 2 ))- 


(3.36) 


where ((/3) = P 2 su P/3 0 ^[u L ,u R ] |(/3o)| and Ul and Ur are as in (1.9). This implies, by condition 
(c) of Theorem 1.2, that G is a contraction. Thus, using the Banach Fixed Point Theorem [23], 
G has a unique fixed point. This implies (a(/3 ), a(/3)) is the unique fixed point of G , and the only 
stationary point of ip in (z m i n , x max ) 2 . 

Finally, we establish that the critical point (a(/3), a(/5)) is a local maxima of the function G. To 
see this note that the Hessian 

V 2 V>(x, y) = —diag(T"(x), T”(y)) + /3 2 sech 2 /3(x - y) ^ ^ . (3.37) 

Using T”(x) = 0R , (Q( c )) i we note that \7 2 ip(a((3), a(/3)) is negative definite provided (3 2 R'(2(3) < 

1/4. This condition is satisfied for f3 sufficiently small as condition (c) holds, thus completing the 
proof. □ 


Remark 3.1. In the SK model, R p (z) = z (defining R p ( 0) = 0), x max = 1, x m j n = —1, and p is 
the semi-circle law (2.1). Moreover, 


T p {x) 


2 - log (2 + x) 

X 2 

4 

k 2 _ lo g ( 2 ~ x ) 


x€ [-2,-1], 
x E [-1,1], 
[1,2]. 


(3.38) 


Note that T p (x) > x 2 /4 in [-2,-1] U [1,2]. Therefore, ip(x,y) < F(x,y ) — x — \ := V) f° r 
(x,y) E [—2,2] 2 . It is easy to see that (2/3, 2/3) E [—1, l] 2 , for (5 < 1/2, is a stationary point of tp. 
The Hessian \7 2 ip(x,y) is negative definitive for (x,y) E [—2, 2] 2 if 

(a) (V 2 -0(x, 2/))n = \ + /3 2 sech 2 f3(x — y) < 0, which holds whenever /? < 4^; and 

(b) det(V 2, i/!(x, 2 /)) = 2 + P 2 sech 2 /3(x — y) < 0, whenever /3 < 

Therefore, the maximum of ip is attained at (2/3,2/3) E [— 1,1] 2 , for /3 < 1/2, where the two 
functions ip and ip agree. Thus, for /3 < 1/2, which is the entire replica symmetric phase, the limit 
in Corollary 2.1 holds. 
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